New method of extracting non-Gaussian signals 
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We propose a new general method of extracting non-Gaussian features in a given field. It sets the 
mean of the field to zero and renormalizes its Fourier (or multipole) power to white noise, while 
keeping the phase information unchanged. Using simulated cosmic microwave background (CMB) 
as an example, we demonstrate the power of this method under many challenging circumstances. 
In particular, we show its capability of detecting cosmic strings in the CMB. 
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A. Introduction: Searching for and characterizing the 
non-Gaussianity (NG) of a given field has been a vital 
task in many fields of science, because we expect the 
consequences of different physical processes to carry dif- 
ferent statistical properties. To illustrate this point, we 
shall employ the cosmic microwave background (CMB) 
as an example, which features several prominent as- 
pects of our interest. 

With the cosmological principle as the basic premise, 
there are currently two main competing theories for the 
origin of structure in the universe — inflation 0] and topo- 
logical defects ||[| ■ Although the beauty and simplicity 
of the former appears to have enticed more adherents, 
the latter can still coexist with the former. In particu- 
lar, the observational verification of defects will have cer- 
tain impact to the grand unified theory, because they are 
an inevitable consequence of the spontaneous symmetry- 
breaking phase transition in the early universe. In ad- 
dition to the conventional study of the power spectra 
of cosmological perturbations, another way to distin- 
guish these models is via the search for intrinsic NG in 
the perturbations — while the standard inflation predicts 
Gaussianity, theories like isocurvature inflation || and 
topological defects provide NG. 

The cosmological test for NG can be implemented in 
both large-scale structure (e.g., and CMB (e.g., 0). 
The former is less favored because its intrinsic statisti- 
cal properties can be easily distorted by the late-time 
non-linear gravitational interaction. Even in the use of 
CMB, there are still several observational and theoretical 
challenges. For example, the intrinsic statistical features 
of the CMB anisotropies may be obscured by the fore- 
ground contamination (from the radio emission of our 
own Galaxy, distant galaxies, etc.), instrumental noise, 
and sample variance; the central limit theorem (CLT) 
will force the large-scale perturbations to be Gaussian 
even if very strong NG mechanisms operate on smaller 
scales. Adding the fact that high-precision and high- 
resolution observation is yet available, various Gaussian- 
ity tests of the CMB data in the literature have so far 
drawn no robust conclusion ||. Nevertheless, a flood of 
high-accuracy data will be available in the near future 



[p|-|rT|, and we shall need a comprehensive formalism to 
deal with them. 

In the literature, the most commonly used statistics for 
the NG test are moments, cumulants, n-point correlation 
functions, bispectra, Minkowski functionals, peak statis- 
tics, etc (for a review see, e.g., [| 12 ). These statistics 



can be defined in different statistical spaces such as the 
real, Fourier, wavelet, and eigen-spaces. Here we shall 
combine both the real and Fourier spaces to propose a 
new method that helps extract out the NG from a given 
field. With the extracted NG, it then becomes easier for 
the conventional statistics to characterize and then iden- 
tify the NG originated from different physical processes. 
B. A new method: The defining property of a Gaus- 
sian field is that the higher-order (greater than two) re- 
duced moments vanish. This means that the only infor- 
mation that any Gaussian field carries is its mean and 
the two-point correlation function, or cquivalently, the 
power spectrum in the Fourier space. Therefore we can 
refer to the mean and the power spectrum as the 'Gaus- 
sian components' of a field, and any extra information 
will indicate NG. Thus, the new method aims to noth- 
ing but 'removing' these Gaussian components. Here we 
shall use an n-dimensional field A(x) to demonstrate this 
formalism. 

First we Fourier transform A(x) to obtain A(k) = 
/ dx"A(x)e _lk ' x . Second, we estimate the power spec- 
trum as C k = (\A{k)\ 2 ) k /V n , where k = |k| and V n 
is the n-dimensional volume of the field. Third, we set 
A(0) = and define (Vk with C k ^ 0) 



A P (k) 



A(k)c; 1/2 P, 1/2 , 
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where P k is a given function of k. Finally, we inverse 
Fourier transform Ap(k) back to the real space Ap(x). 
Now the field Ap(x) has a mean zero and a power spec- 
trum renormalized to P k ■ Therefore the original field can 
be written as A(x) = Ap(x) ® -D(x) ®Q(x) + A, where 
an overbar denotes the mean of a field, the symbol (g) de- 
notes a convolution, and D(x) and Q(x) are the inverse 

— 1/2 

Fourier transforms of the symmetric fields D(k) = C k 

— 1/2 

and Q(k) = P k respectively. We shall investigate the 
simplest case where P k = 1, and leave the discussion for 
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(c) 



other forms of Pk later. In this case, the field A(x) will 
be 'whitened' in Fourier power through the above proce- 
dure, and we shall use a superscript 'W to denote such 
whitened fields. In the real space, this means 



A(x) = A w (x) ® D(x) + A. 



(2) 



Now the Gaussian components, A and D, are separated 
from the rest. Therefore if A is Gaussian, then all sam- 
ples in A w should appear as uncorrelated white noise. 
Otherwise A w would contain all the NG features. 

We note that the above new method (eq. ffl) is equiv- 
alent to the matrix manipulation dp = P^ 2 C _1 / 2 d, 
where d = A, and P and C are the two-point corre- 
lation matrices specified by Pk and Ck respectively. In 
signal processing, it is common to model C as a linear 
sum of the signal and noise, i.e. C = S + N. Combined 
with the Wiener filtering d WF = SC _1 d, the whitening 
procedure is then d w = S 1 / 2 C _1 d. It not only removes 
the power on scales where the noise dominates, but also 
equalizes the power on scales where the signal dominates. 

To see how this formalism can work in practice, we 
consider a linear sum of a Gaussian and a NG fields: A = 



KG) 



A(ng)- Assuming no correlation exists between 



A( G ) and A( N q), as in normal situations, we have Ck 



a 



fe(G) 



C k (NG) , leading to 



°fc(G) ^(G) + ° 



1/2 



fc(NG) "(NG) 



fc(G) 



' Cfe(NG) 



1/2 



(3) 



Thus we see that if Ck(G) < Cfc(NG) &t a certain k, the 
NG part A^ G ^ will dominate. As a consequence, we 
expect the NG feature to show up in A w as long as the 
NG signal dominates within a certain range of k. As we 
shall see later, in most cases the NG features in A w are 
already largely visually recognizable. Therefore, instead 
of trying further statistics to characterize our A w , we 
shall concentrate more on the use of this formalism, in 
the face of various challenging situations. 
C. Against various NG components: We now ap- 
ply this new formalism to the CMB anisotropics A(x) = 
[T(x) - T] /T, where T(x) is the CMB temperature at 
the direction x. In the 'small-field limit' where the curva- 
ture of the sky can be ignored, the conventional multipolc 
decomposition A(x) = Xij m a <™^m( x ) simplifies to a 
two-dimensional Fourier transform such that the multi- 
pole number is simply £ = |k| = k and the angular power 
spectrum becomes Ct = (\ai m \ 2 ) = Ck- We shall work in 
this small-field limit throughout the paper. First we sim- 
ulate a CMB field of size 1° x 1° and resolution 256 2 , with 
six components mimicking results of different physical 
processes (see Fig. Q): (a) 10 filled circles of diameters 64 
and 128 grids, (b) 10 rectangles of size 32 x 48 grid 2 . The 
sharp edges of components (a) and (b) are similar to the 
integrated Sache-Wolf (ISW) effect produced by cosmic 




FIG. 1. Six components in a simulated CMB map of 1° x 1° . 



strings |13| . (c) 5 sharp circular points (like distant galax- 
ies), each with a power spectrum Ck( c ) = exp [— (ri?fc) 2 ] , 
where R is the angular size of the field (1° here) and 
r = 0.005. (d) 5 diffuse circular points (like cosmic 
defects such as textures [Q), each with a power spec- 
trum Ck(d) = [1 + (si?fc)'] _1 where s = 0.5 and t = 6. 
(c) a Gaussian background (like the inflationary pertur- 



bations) of power spectrum Ck en k 2 exp — (j^h) 



>1200; 

which fits well on small scales (k ^> 1) with that of a stan- 
dard CDM model computed from CMBFAST [||. Even 
in the cosmic-defect-dominated models, this Gaussian 
background is still expected as the contribution from be- 
fore the last scattering epoch t\ s . This is because, in any 
cases, the sub-degree perturbations are smeared away due 



to the photon diffusion damping by t\ s |16| (thus creating 
the exponentially decaying power as seen above), while 
the super-degree perturbations must be nearly Gaus- 
sian due to causality and the CLT (the angular size of 
the horizon at t\ s is about one degree), (f) Gaussian 
noise (like instrumental effects) of white power spectrum. 
Thus the components (a)-(d) are non-Gaussian, while 
(c) and (f) are Gaussian. Fig. || shows the power spec- 
tra of each components. These six components are then 
linearly added up A — X^A^, with RMS ratios (a-f) 
1 : 1 : 10 : 500 : 1000 : 0.2 (see Fig. | left). We then 
apply our new method to obtain the whitened field A w 
(see Fig. || right). 

The above results indicate three things. First, al- 
though the NG components (A a -Ad) dominate the 
Gaussian ones (A c and Af) in power only within a cer- 
tain range of k, their NG features are almost fully recov- 
ered in A w . This confirms the expectation associated 
with equation (||) and its context. Second, the centers of 
the diffuse points in Ad are now fully recovered in A w , 
although hardly recognizable in Ad (see Fig. |l| (d)). Fi- 
nally, although A c and Ad are much stronger than A a 
and Ab in power on all the scales where the NG com- 
ponents dominate, the NG features of A a and Ab are 
still fully recovered in A w . This is essentially because 
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FIG. 2. Power spectra of the six components in Fig. pj. 




FIG. 3. Left: a simulated CMB map A, with the six com- 
ponents presented in Fig. |l| linearly added up. Right: the 
square of the extracted NG signal (A w ) 2 . The color scheme 
in the left plot is between the minimum and the maximum 
while that in the right plot is from zero to one sigma square. 



the intrinsic NG features of these NG components are 
uncorrelated. 

We also tested our method against the CLT, which 
many statistics in the literature suffer from, especially 
for those designed in the Fourier space (e.g., [jl7|). To do 
this, we considered the linear sum A = A(q) + A(ng) of a 
Gaussian background A(q) and 200 randomly distributed 
filled rectangles Anvjc) ( or diffuse points), with an RMS 
ratio cr(G)/ cr (NG) = 10 6 - In the resulting A w , the shapes 
of the 200 rectangles (or the centers of the diffuse points) 
are clearly recovered. Thus we have seen a treatment 
which combines both the real and Fourier spaces to take 
the advantages of both, without being affected by most 
of their disadvantages. 

D. Searching for cosmic strings: We first simulate a 
CMB field of 2° x 2° with a resolution of 256 2 , as observed 
from an interferometric experiment: 



A = [(A bg + A pnt + A 



SISW 



W + A E 



(4) 



Here Ab g is the Gaussian background (like the A e in 
the previous section), A pnt is the point source (Fig. [|(a), 
like the A c in the previous section but with r = 0.01 




FIG. 4. Simulated CMB maps of 2° x 2 



joint sources 



A pnt ; (b) cosmic-string-induced CMB Asisw; (c) the total 
field A; (d) the extracted NG signal A w . The color schemes 
here are within the three-sigma range. 



here), and Asisw is the string-induced CMB (Fig. 0(b)). 
The Asisw is obtained using a toy model [|l8|,[l9| which 
incorporates most important properties of cosmic strings 
such as self-avoiding and scaling. Ab g , A pnt and Asisw 
have RMS ratios 5:2:1. W t with i = 'p' and 'o' 
denote respectively the primary and observing windows 
of a Gaussian form, with Full Widths at Half Maximum 
of 0.7 and 0.02 in units of the field size. A no i is a 5% 
noise. 

The resulting A and the whitened field A w are shown 
in Fig. ^ as (c) and (d) respectively. Fig. || shows the 
power spectra of various components in A, with the con- 
volution effect from W included. As we can see, al- 
though the Asisw is hardly recognizable in A and its 
power (the dashed line in Fig. ||) is dominated by other 
components on all scales, its NG feature (the filament 
structure) is still fully recovered in A w , even with the 
fact that the amplitude of the A pn t in A w is eighty 
times the RMS of A w . We have also applied our new 
method to simulations of single-dish experiments, where 
A = [(A bg + Ap n t + Asisw) ® W Q ] VF p + A noi . The main 
conclusions remain unchanged. In general we expect the 
string-induced NG feature to show up towards smaller 
scales, if not severely obscured by the noise. This is be- 
cause the Cfc(bg) must have an exponentially decaying 
tail due to the photon diffusion damping Jlq] while the 
Cfc(SiSW) must have a power-law decay due to its step- 
function like anisotropies on small scales. 
E. Discussion and conclusion: We proposed a new 
method of extracting the NG features of a given field. Us- 
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FIG. 5. Power spectra of various components in Fig. |4| (c). 
The W 2 (fc) is the power spectrum of the observing window. 

ing the CMB anisotropies as an example, we numerically 
justified this method under various challenging circum- 
stances. In particular we show its capability of detecting 
cosmic strings. With even more tests that are similar to 
those implemented above, the main observation remains 
the same: in a field A = A( G ) + A( NG ), regardless how 
stronger the RMS of A( G ) is, the NG features of A( NG ) 
can always show up in the whitened field A w as long as 
Cfe(NG) dominates Ck(G) within a certain range of fc, even 
if it is only a tiny fraction of the total accessible k range. 
In the extracted NG field A w , the NG features of un- 
correlated NG components do not mix up even if some 
of them dominate the others in power. We also found 
that the new method is insensitive to the CLT. With 
such whitened fields, we can then further apply conven- 
tional statistics to, for example, locate the NG scale. Al- 
though some parameters in our simulations may not be 
realistic, we chose these values only for the convenience 
of demonstrating the power of our method under vari- 
ous situations. The generalization of this method for the 
treatment of a full spherical CMB sky is straightforward 
and will be investigated in more details elsewhere Jl8| . It 
should be also noted that our formalism is similar to the 
pre-whitening process commonly used in various context 
of data analysis, but applied as a tool for extracting NG 
features for the first time here. 

We also notice that according to equation ([!]), in prin- 
ciple we can design a 'window function' P^ to remove 
the power on scales where the Gaussian signals of a field 
dominate, so that the resulting Ap will have very well 
amplified NG feature. However, in general we do not 
know on which scale the NG signal dominates. Thus 
the method proposed here, which effectively equalizes the 
power on all scales, is optimal when no priori knowledge 
about a given field is available. This may even enable 
us to find the NG signals of unknown physical processes. 
The use of the Pk in our formalism for various purposes 
will be further investigated elsewhere [Q~8| . In conclusion, 



we expect the new formalism to be useful not only for the 
CMB observations in the near future, but also for other 
fields of science where identifying the NG components in 
a given field is important. 
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